Picture for Lang Feng

Lang Feng

Beyond Trajectory-Level Attribution: Graph-Based Credit Assignment for Agentic Reinforcement Learning

Add code
May 26, 2026
Viaarxiv icon

PrismFlow: Residual Dynamics for Flow Matching in Time-Series Generation

Add code
May 22, 2026
Viaarxiv icon

Hierarchy-of-Groups Policy Optimization for Long-Horizon Agentic Tasks

Add code
Feb 26, 2026
Viaarxiv icon

Online Causal Kalman Filtering for Stable and Effective Policy Optimization

Add code
Feb 11, 2026
Viaarxiv icon

Dr. MAS: Stable Reinforcement Learning for Multi-Agent LLM Systems

Add code
Feb 09, 2026
Viaarxiv icon

AnomSeer: Reinforcing Multimodal LLMs to Reason for Time-Series Anomaly Detection

Add code
Feb 09, 2026
Viaarxiv icon

AgentOCR: Reimagining Agent History via Optical Self-Compression

Add code
Jan 08, 2026
Viaarxiv icon

CaveAgent: Transforming LLMs into Stateful Runtime Operators

Add code
Jan 04, 2026
Viaarxiv icon

TimeMaster: Training Time-Series Multimodal LLMs to Reason via Reinforcement Learning

Add code
Jun 16, 2025
Viaarxiv icon

Group-in-Group Policy Optimization for LLM Agent Training

Add code
May 16, 2025
Viaarxiv icon